NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Generalizable Tool-use Skills through Trajectory Generation

Qi, Carl; Wu, Yilin; Yu, Lifan; Liu, Haoyue; Jiang, Bowen; Lin, Xingyu; Held, David (September 2024, arxiv.org)

Autonomous systems that efficiently utilize tools can assist humans in completing many common tasks such as cooking and cleaning. However, current systems fall short of matching human-level of intelligence in terms of adapting to novel tools. Prior works based on affordance often make strong assumptions about the environments and cannot scale to more complex, contact-rich tasks. In this work, we tackle this challenge and explore how agents can learn to use previously unseen tools to manipulate deformable objects. We propose to learn a generative model of the tool-use trajectories as a sequence of tool point clouds, which generalizes to different tool shapes. Given any novel tool, we first generate a tool-use trajectory and then optimize the sequence of tool poses to align with the generated trajectory. We train a single model on four different challenging deformable object manipulation tasks, using demonstration data from only one tool per task. The model generalizes to various novel tools, significantly outperforming baselines. We further test our trained policy in the real world with unseen tools, where it achieves the performance comparable to human.
more » « less
Full Text Available
Learning Closed-Loop Dough Manipulation Using a Differentiable Reset Module

https://doi.org/10.1109/LRA.2022.3191239

Qi, Carl; Lin, Xingyu; Held, David (October 2022, IEEE Robotics and Automation Letters)

Full Text Available
Self-supervised Transparent Liquid Segmentation for Robotic Pouring

https://doi.org/10.1109/ICRA46639.2022.9812000

Narasimhan, Gautham; Zhang, Kai; Eisner, Ben; Lin, Xingyu; Held, David (May 2022, Self-supervised Transparent Liquid Segmentation for Robotic Pouring)

Full Text Available
Mesh-based Dynamics with Occlusion Reasoning for Cloth Manipulation

https://doi.org/10.15607/RSS.2022.XVIII.011

Huang, Zixuan; Lin, Xingyu; Held, David (January 2022, Robotics: Science and Systems (RSS))

Full Text Available
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools

Lin, Xingyu; Huang, Zhiao; Li, Yunzhu; Tenenbaum, Joshua B.; Held, David; Gan, Chuang (January 2022, International Conference on Learning Representations (ICLR))

We consider the problem of sequential robotic manipulation of deformable objects using tools. Previous works have shown that differentiable physics simulators provide gradients to the environment state and help trajectory optimization to converge orders of magnitude faster than model-free reinforcement learning algorithms for deformable object manipulation. However, such gradient-based trajectory optimization typically requires access to the full simulator states and can only solve short-horizon, single-skill tasks due to local optima. In this work, we propose a novel framework, named DiffSkill, that uses a differentiable physics simulator for skill abstraction to solve long-horizon deformable object manipulation tasks from sensory observations. In particular, we first obtain short-horizon skills using individual tools from a gradient-based optimizer, using the full state information in a differentiable simulator; we then learn a neural skill abstractor from the demonstration trajectories which takes RGBD images as input. Finally, we plan over the skills by finding the intermediate goals and then solve long-horizon tasks. We show the advantages of our method in a new set of sequential deformable object manipulation tasks compared to previous reinforcement learning algorithms and compared to the trajectory optimizer.
more » « less
Full Text Available
Planning with Spatial-Temporal Abstraction from Point Clouds for Deformable Object Manipulation

Lin, Xingyu; Qi, Carl; Zhang, Yunchu; Huang, Zhiao; Fragkiadaki, Katerina; Li, Yunzhu; Gan, Chuang; Held, David (January 2022, Conference on Robot Learning (CoRL))

Full Text Available
Learning Visible Connectivity Dynamics for Cloth Smoothing

Lin, Xingyu; Wang, Yufei; Huang, Zixuan; Held, David (January 2021, Conference on Robot Learning)

Robotic manipulation of cloth remains challenging due to the complex dynamics of cloth, lack of a low-dimensional state representation, and self-occlusions. In contrast to previous model-based approaches that learn a pixel-based dynamics model or a compressed latent vector dynamics, we propose to learn a particle-based dynamics model from a partial point cloud observation. To overcome the challenges of partial observability, we infer which visible points are connected on the underlying cloth mesh. We then learn a dynamics model over this visible connectivity graph. Compared to previous learning-based approaches, our model poses strong inductive bias with its particle based representation for learning the underlying cloth physics; it can generalize to cloths with novel shapes; it is invariant to visual features; and the predictions can be more easily visualized. We show that our method greatly outperforms previous state-of-the-art model-based and model-free reinforcement learning methods in simulation. Furthermore, we demonstrate zero-shot sim-to-real transfer where we deploy the model trained in simulation on a Franka arm and show that the model can successfully smooth cloths of different materials, geometries and colors from crumpled configurations.
more » « less
Full Text Available
SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation

Lin, Xingyu; Wang, Yufei; Olkin, Jake; Held, David (January 2020, Conference on Robot Learning)
null (Ed.)
Manipulating deformable objects has long been a challenge in robotics due to its high dimensional state representation and complex dynamics. Recent success in deep reinforcement learning provides a promising direction for learning to manipulate deformable objects with data driven methods. However, existing reinforcement learning benchmarks only cover tasks with direct state observability and simple low-dimensional dynamics or with relatively simple image-based environments, such as those with rigid objects. In this paper, we present SoftGym, a set of open-source simulated benchmarks for manipulating deformable objects, with a standard OpenAI Gym API and a Python interface for creating new environments. Our benchmark will enable reproducible research in this important area. Further, we evaluate a variety of algorithms on these tasks and highlight challenges for reinforcement learning algorithms, including dealing with a state representation that has a high intrinsic dimensionality and is partially observable. The experiments and analysis indicate the strengths and limitations of existing methods in the context of deformable object manipulation that can help point the way forward for future methods development.
more » « less
Full Text Available
Adaptive Auxiliary Task Weighting for Reinforcement Learning

Lin, Xingyu; Baweja, Harjatin Singh; Kantor, George; Held, David (December 2019, Advances in neural information processing systems)

Reinforcement learning is known to be sample inefficient, preventing its application to many real-world problems, especially with high dimensional observations like images. Transferring knowledge from other auxiliary tasks is a powerful tool for improving the learning efficiency. However, the usage of auxiliary tasks has been limited so far due to the difficulty in selecting and combining different auxiliary tasks. In this work, we propose a principled online learning algorithm that dynam- ically combines different auxiliary tasks to speed up training for reinforcement learning. Our method is based on the idea that auxiliary tasks should provide gradient directions that, in the long term, help to decrease the loss of the main task. We show in various environments that our algorithm can effectively combine a variety of different auxiliary tasks and achieves significant speedup compared to previous heuristic approaches of adapting auxiliary task weights.
more » « less
Full Text Available
Visual Self-Supervised Reinforcement Learning with Object Reasoning

Wang, Yufei; Narasimhan, Gautham Narayan; Lin, Xingyu; Okorn, Brian; Held, David (January 2020, Conference on Robot Learning)
null (Ed.)
Current image-based reinforcement learning (RL) algorithms typically operate on the whole image without performing object-level reasoning. This leads to inefficient goal sampling and ineffective reward functions. In this paper, we improve upon previous visual self-supervised RL by incorporating object-level reasoning and occlusion reasoning. Specifically, we use unknown object segmentation to ignore distractors in the scene for better reward computation and goal generation; we further enable occlusion reasoning by employing a novel auxiliary loss and training scheme. We demonstrate that our proposed algorithm, ROLL (Reinforcement learning with Object Level Learning), learns dramatically faster and achieves better final performance compared with previous methods in several simulated visual control tasks.
more » « less
Full Text Available

« Prev Next »

Search for: All records